Aggregations over Generalized Hypertree Decompositions
نویسندگان
چکیده
Recent multiway join algorithms have been shown to be asymptotically faster than any sequence of pairwise joins. Generalized Hypertree Decompositions (GHDs), introduced by Gottlob et al., provide a query plan for utilizing these multiway join algorithms to solve conjunctive queries. It is interesting to extend these new algorithms and query plans to more sophisticated queries, e.g. those including aggregations. Inspired by the semi-ring annotations of Green et al. and the query language of Abo Khamis et al., we study a class of queries involving aggregations and joins over annotated relations. Our queries are rich enough to capture traditional notions of queries with aggregations, e.g. group by aggregations in SQL, as well as a wide range of problems outside traditional data processing, e.g. message passing in graphical models. We then extend existing algorithms and structures to answer these queries. Our work has three primary contributions. First, we recover and improve the state of the art by placing a simple, easy to check condition on permissible GHDs. Second, we describe how to construct these “valid” GHDs by stitching together (ordinary) GHDs of a series of hypergraphs. This allows us to apply a series of existing research on GHDs, including some that provides strongest-known runtime guarantees for both serial and distributed join algorithm. Finally, we extend our problem to compute queries with multiple different aggregations, delving into and characterizing completely the details of how aggregations and joins can be reordered. 1998 ACM Subject Classification H.2.4 Database Management Systems
منابع مشابه
New Heuristic Methods for Tree Decompositions and Generalized Hypertree Decompositions
Many practical hard problems in mathematics and computer science may be formulated as constraint satisfaction problems (CSPs). Tree and generalized hypertree decompositions are two important concepts which can be used for identifying and solving tractable classes of CSPs. Unfortunately the task of finding an optimal tree or generalized hypertree decomposition is an NP-complete problem. Thus man...
متن کاملGenetic algorithms for generalized hypertree decompositions
Many practical problems in mathematics and computer science may be formulated as constraint satisfaction problems (CSPs). Although CSPs are NP-hard in general, it has been proven that instances of CSPs may be solved efficiently, if they have generalized hypertree decompositions of small width. Unfortunately, finding a generalized hypertree decomposition of minimum width is an NP-hard problem. B...
متن کاملHeuristic Methods for Hypertree Decomposition
The literature provides several structural decomposition methods for identifying tractable subclasses of the constraint satisfaction problem. Generalized hypertree decomposition is the most general of such decomposition methods. Although the relationship to other structural decomposition methods has been thoroughly investigated, only little research has been done on efficient algorithms for com...
متن کاملUnderstanding Tractable Decompositions for Constraint Satisfaction
Constraint satisfaction problems (CSPs) are NP-complete in general, therefore it is important to identify tractable subclasses. A possible way to find such subclasses is to associate a hypergraph to the problem and impose restrictions on its structure. In this thesis we follow this direction. Among such structural properties, particularly important is acyclicity: it is well known that CSPs whos...
متن کاملTabu Search for Generalized Hypertree Decompositions
Many important real world problems can be formulated as Constraint Satisfaction Problems (CSPs). A CSP consists of a set of variables each with a domain of possible values, and a set of constraints on the allowed values for specified subsets of variables. A solution to CSP is the assignment of values to variables, such that no constraint is violated. CSPs include many NP-complete problems and a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1508.07532 شماره
صفحات -
تاریخ انتشار 2015